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Uniformly Best Constant Risk and Minimax 

Point Estimates^ 

Raymond P. Peterson 

In this paper several types of point estimates are compared on the bases of their corre- 
sponding expected risk. It is shown that constant risk minimax estimates (which are always 
uniformly best constant risk estimates) exist, under certain conditions, for several frequently 
occurring types of parameters and general methods are obtained for constructing these 
minimax estimates. 



1. Introduction 

LetiCi, . . ., X;^ denote 71 (not necessarily independ- 
ent) observed values of a random variable ^, which 
is distributed over a space S according to a distribu- 
tion P^(x, 6i, . . ., 6s). It is assumed that P^(:x, 
di, . . ., 6s) is completely specified except for the 
s unknown parameters 6i, . . ., Og. These param- 
eters may be represented by a point 6=(6i, . . ., 
6s) in the s-dimcnsional Euclidean parameter space 
0. Also X= (xi, . . ., Xn) is a point in the n-dimen- 
sional Euclidean sample space, M. We shall assume 
that P|(x, 6) is absolutely continuous, that is, ^ 
possesses an integrable probability density function 
g^{x, 6). Let p{X, 6)=p{xi, . . ., Xn. 6) denote the 
joint probability density function of the observa- 
tions at XeM. 

A statistical point estimate of a parameter 6 1. 
which ranges over a subset w^ of one-dimensional 
Euclidean space, is a function /i(X) of the sample 
values that takes on values in co,-. Let W\ii{X), 6] 
be a nonnegative measurable function defined for all 
6e9. and XeM. W[ji{X), .6] is a weight function that 
represents the relative seriousness of taking /i(A^) as 
the value of 6i for any particular sample point X, 
The function 



r/,(e)= 



( W 

J M 



-j,(X),e]p{X,d)dX 



represents the risk or expected loss incurred by using 
Ji(X) to estimate ^^ when d is the true parameter 
point. Thus rf.{e) is defined as the risk function of 
Ji{X). The expected risk oiJi(X), relative to an a 
priori distribution X (0) of 6 is given by 



/?,,(X)= f f 1 



W[fi{X),d]p{X,d)dXd\{6). 



We can now^ define the following classes of point 
estimates in terms of rf.{6) and Rf.{\). 

(1) A minimax estimate of ^i is one which minimizes 

supr;.^((9). 
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(2) A constant risk (CR) estimate /j of 6i is one 
such that rf.{6) is constant. 

(3) A Bayes estimate of 6i^ relative to an a priori 
distribution x(^), is one that minimizes Rf.{\). 

(4) A uniformly best (UB) estimate of 6i is one 
that minimizes /i/.(X) for all possible a priori dis- 
tributions X(^). 

(5) A unijormly best constant risk (UBCR) estimate 
of ^i is one that is a UB-estimate among all CR- 
estimates. 

(6) A constant risk minimax (CRM) estimate is- 
an estimate that is both a CR and a minimax 
estimate. 

It is evident that a LTB-estimate is preferable to 
any other, provided that one can be obtained. We 
will show that in several important cases it is reason- 
able to restrict our choice to CRM -estimates, since 
they possess certain desirable properties and, in most 
cases, are relatively easy to obtain. The concepts 
of a risk function, expected risk, and minimax 
estimates used here are due to Wald [5 to 8]. 

Let 



^,^{X)^^W [j, (X), e]p(X, 6) d6. 



(1) 



Theorem 1.1 Let ji{X) be a CR estimate of 6i and 
suppose that far any other estimate fi(X) there exists 
a probability measure \{d) over 12 such that 

jjf,{d)dl{d)<jjj^{d)d\{6). (2) 

ThenJi(X) is a minimax estimate. 

Proof. Let fi(X) be any other estimate and let 
X(^) be a probability measure such that (2) is satis- 
fied. Then 

c-supr-.(^)< (lc-rj.{d)]d\(d) 
e J^ 

where rf.{6) = c (c a constant). Therefore 

c=supr/ ((9)<supr- ((9), 
d e 

and the theorem follows. 
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Corollary 1.1 Any CR-estimate which is a Bayes 
estimate relative to some probability measure X'(^) is 
a minimax estimate. 

Proof. This corollary follows immediately from 
theorem 1.1. 

Theorem 1.2. Any CR-estimate fi(X) that is a 
minimax estimate is a UB CR-estimate. 

Proof. Let fi(X) be anv other CR-estimate. 
Then ^ 

Rf, (X) -Rf, (X) - [jr-, {d)-r,^ {d)] d\ {d) 

= c — c = sup r7 (^)— sup rf {d)>0, 
d ' d ' 

where _X(^) ^^ ^^3" probability" measm^e over 12, 
rj.{6) = c, rf.(d) = c, (c and c constants). Thus 
J t [X) is a UBCR-estimate. 

Theorem 1.3. Suppose that a CR-estimate fi{X) 
minimizes (l>f.{X) for all XeM and that at lead one 
of the following conditions (A) and (B) is satisfied: 

(A) is compact^ 

(B) rf^{d) and (i>f^{X) are uniformly convergent over 
M and ^, respectively. Then fi {X) is both a mmimax 
and a UBCR-estimate of Si. 

Proof. Let fi{X) be any other estimate of Bi. 
Then 

<^7,(^)-c/>/,(X)>0 (3) 

for all XeM. Let Mq and 12^ be compact subsets 
of positive measure of M and 12, respectively, such 
that 



and 



M,<M,+,, 


2 = 


1,2, 


0,<",+i, 


i= 


1,2, 


lim Mg- 

q= CO 

lim Q,j 


= M 

= 0.. 


J 



Since W\ii{X),6\ is nonnegative and measurable, it 
follows from Fubini's Theorem (see, for example, [1] 
or [4]) that 

f f wmx)Av{XMedX 
^LJm. W[fi{X),eMX,d)dXdd (4) 

for all j and q. By hypothesis, 

rf,(e)^c, 

where c is a constant. 

Suppose first that condition (A) is satisfied; that 
is, 12 is compact. Then from (4), 



f f W[MX),e]p{X,d)dedX 

- r r w[f,(X),d]p{x,d)dxdd, 

Since 12 is compact, 

f [4>7,{X)-ct,4X)]dX= ( [r-ld)-rr,{d)]de^ 

J M J Q 

= ([r-^ie)-c]d9<[suY>r-p)-c']jj9. 



(5) 



But, it follows from (3) that the first integral in 
(5) is nonnegative and therefore 



sup ry.{d)>c. 



(6) 



Now suppose that fi is not compact, but condition 
(B) is satisfied. From (3) it is seen that either 



4>7,{X)-<i>,^{X)^0 



for all XtM or 



4>-,{X)-4>r,{X)>,>0 



(7) 



(7a) 



over some set M^ in M, where M' has finite positive 
measure m'. 

First, consider the. case where (7a) holds. Let M^ 
always be taken so that M^<CMg. Then, from (3), 
(4) , and (7a) , it follows that 

lim f f {W[J,{X),d]-WlUX),d]}piX,e)dddX- 

= L ( {wU,{X),d]-W[UX),e]}p(X,e)dedX 

= r [ci>-4X)-cl>,,(X)]dX>em\ 

LetW,-Wi=W[f,{X),d]-W[fdX),e]. Since, by 
hypothesis, 

f [W,-W,]p{XSdX 

is uniformly convergent over M, there exists a go 
such that 

I f [w,-w^]p{X,e)dX 

- f Wi-w^]p{X,e)dX 

J M 

Also since, by hypothesis, 

r [W^-W^]p(X,^)de 



<2' 9;^2o 



H8) 
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is uniformly convergent over 12, there exists a jo 
such that 



I f [w,-wMx,e)de 



^ em' .. . 



(9) 



where /n^^ is the measure of the set Mg^. Then, from 
(7a) and (9) it follows that 

f f [Wt-wMx,e)dXde 
= f f m-wMx,e)dedX 

= f f [m - l^',]p(X, e)d9rfA' +a^ 



^em'- 



em em 



Therefore 



f f [w,-w,]v{X,e)dXdey'^- (lo) 

From (8) and (10) we have that 

J ^^^Wi-wM^MX+^-^dey'-^, i^Kli, 

and thus 

\{..^r-,ie)-cy,.^-\lde 

%J IqL- -J 



(11) 



If sup r-f. {d)~c<^0, (11) is impossible since e is arbi- 

' 

trarily small, and therefore 

suprj.{d)-c>0. (12) 

d 

The proof that (12) is true for the case in which 

(7) holds is immediate. Hence, since c = sup rj (6), 

e 
fi{X) is a minimax estimate and by Theorem 1 .2 is also 
a UBCR-estimate. 

2. Classes of CR-Estimates 

In this section we shall find classes of CR-esti- 
mates for several frequently occurring types of 
parameters. 

2.1. L-Estimates of a Location Parameter 
If p(X,d) can be expressed in the form 



p{xi—e, . . . , Xn—S) 



(13) 



then d is called a location parameter. An estimate 
'f{xi, . . . , Xn) will be called an L-estitnate of the 
location parameter 6 provided 

J{Xi+T, . . . , Xn+T)=J{Xu . . . , Xn)+r (14) 

for any real r. 

2.2. S-Estimates of a Scale Parameter 
If _p(X, 6) can be expressed in the form 



"Kt •••'¥> '^'' 



(15) 



then 6 is called a scale parameter. An estimate 
j{xi, . . . , Xn) will be called an S-estimate of the 
scale parameter Q provided 

j{iXXi, . . . , M.^J=m/(^1, . . . , Xn), fX>0 (16) 

and 

M,, . .,.rj>0. (17) 

2.3. L(S) -Estimates of a Location Parameter (Scale 

Parameter Unknown) 

Suppose p{X, 6) is of the form 

where di and ^2 are unknown parameters. Then an 
estimate yi(:i^i, • • • , Xn) will be called an L{S)- 
estimate of the location parameter ^1 (the scale 
parameter 62 unknown) provided that fi(X) is an 
L-tstirnate, that is, (14) is satisfied^ and also 

/l {fJiXu . . . , M.^J = m/i [Xl, ' ' • , Xn) (19) 

for any real /x. 

2.4. S(L)-Estimates of a Scale Parameter (Location 

Parameter Unknown) 



Let p(Xj 6) be of the form 

^-, ^/" Xi — di Xn — dA 



(92 >0 (20) 



where ^1 and 62 are unknown parameters. Then an 
estimate /2(xi, . . ., Xn) is called an S(L) -estimate 
of the scale parameter 02 (the location parameter ^1 
unknown) provided that /2(a:) is an S-estimate, that 
is,/2(X) satisfies (16) and (17), and also 

f2{Xi + T, . . . , Xn+T)=f2(Xu • • • , ^T J (21) 

for all real r. 

2.5. D-Estimates of the Difference Between Two 
Location Parameters 

Let p(Xy F, d, d) be the joint probability density 
function of Xu • . - , x^ and 2/1, • • • 1 Vn, where 
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X and Y are samples from two populations with 
unknown location parameters 6 and 6+5, respec- 
tively. Then p(X, Y, 6, 6) is of the form 

p^x,-d, . . ., Xn,-d, yi-d-d, . . . , Vn-e-h). (22) 

An estimate /(X, Y) will be called a D-estimate of 
the difference 8 provided 

/(xi+M, . . . , Xm+fji, yi+\ . • • , yn+^) 

=/(Xi, . , . , Xm. Vu > . . , 2/n)+(>^— m) (23) 

for all real fx and X. 

2.6. 1?-Estimates of the Ratio of Two Scale Parameters 
Let p(X, Y, e, p) be of the form 



e-^{p 



e)-p(^, 



2/1 

7e' 



yn\ 



e, P>o 

(24) 

where X and Y are samples from two populations 
with unknown scale parameters 6 and pO, respec- 
tively. An estimate ^f(X, Y) is called an E-estimate 
of the ratio p provided 



filxXi, . 



. , flXm,) X?/i, 



Vn) 



(25) 



for all M, X>0 

We now show that an^^ estimate belonging to one 
of the classes 2.1 to 2.6 is a CR-estimate provided 
that the weight function Wi is of proper form. The 
following six theorems are stated as one. 

Theorems 2.1 to 2.6. Let the density function p be 
of the form given in classes 2.1 to 2.6 and let the 
weight junction Wi he of the form (1) W\j{X) — d\, 

(2) W[e-\i{X)i (3) WW CUX)-e,)i (4) w\e^' 

UX)l (5) W[j{XJ)-hl (6) W[p-\KX, Y)]. Then, 
iffi is an (1) L, (2) S, (3) i(S), (4) S(L), (5) D, 
(6) E-estimate, the risk function rf. (6) is constant. 

Proof. We shall prove only theorem 2.3, as the 
others are proved in exactly the same manner. 
Consider the risk function 






Let 



ti^- 



(^ = l,2,, 



n). 



dx„. (26) 



(27) 



Since /i(X) is an i((S') -estimate 

UX)-e,_Ux„. ..,Xn) 



■=/,(s^ 



'") 



— /l(^lj • • • ^n)» 



(28) 



Thus, making the transformation (27) and using 
(28) in (26), we have 



J 00 /*« 



W[j,{t^, . . . ,tn)]p{t,, . . . ,tn)dt,. . .dtn 



which is completely independent of B and therefore 
f/^ {6) is constant. 

3. CR-Minimax Estimates 

As a direct consequence of Theorem 1.3 and Theo- 
rems 2.1 to 2.6 we have the following six theorems 
which are stated as one. 

Theorems 3.1 ^o 3.6. If at least one of conditions 
{A) and (B) in Theorem 1.3 is satisfied and if the 
weight function Wi is of the form {\)W[f{X) — e\, 

(2) W[e-f{X)], (3) W[e^Hf,{X)-e,)i{^) W[e^'f,{X)i 

(5) W[f{X, Y)-bl (6) W[p-f{XJ)], then any (1) i, 
(2) S, (3) i(S), (4) S(i), (5) D, (6) E-estimate which 
minimizes ((j>fX) [as defined by formula {l)]for all X, 
{X, Y)eM is a minima^ {and also a UBCE) estimate 
of Si. 

Conversely, it has been shown by Kallianpur [3] 
that, '^ under mild restrictions'^, the minimax esti- 
mate in the above cases minimizes 0/^ and also 
belongs to the corresponding class of CR-estimates. 
For example, in case 3.3, the minimax estimate of Bi 
minimizes <l>f^{X) given in Theorem 3.3 and is an 
LfS) estimate. 

4. Determination of General Classes of 
CR-Estimates 

Suppose the joint probability density function 
p{X, B) is of the form 



p{X,B) = h(7)(x,,B),. ..,v{XnJ))U 



^Xi 



where rj{Xi, B) possesses the n first partial deriv^atis^es 
dr){Xi,B)/dXi continuous in Xi, (i=l, 2, ..., n). 
Let M be an ?i-dimensional interval (a^ <Xi<hi) such 
that 



r]{ai, B)^Ci 
r](bi, B)=di, 



(i=l, 2, 



(29) 



where the a^, bi, Ci and di are constants (possibly 
infinite) which are independent of B. Let ^{x,y) 
and ypfB) be arbitrary functions such that the weight 
function V[^(fi{X), \I/{B))] is non-negative and 
measurable over the product space MX^. Then 
we define the risk function of an estimate /^(X) of 
Bi to be 

rf,{B)^{ V[^Ui{X),m)]v(X,B)dX 

J M 



f 



= V[nMX),m)]Hv{xr,e),. 

1m 



v(xn,B)) n 



<=1 



^r] {Xi,B) 
C)Xi 



dXi. 



(30) 
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The following tlieorein yields a method for deter- 
mhuiig general classes of CR-estimates, that is, 
estimates that possess constant risk functions. 

Theorem 4.1. Ij fi{X)=fi{xi, . . . , x,,) is such 
that 

ji{y){x,,e), . . . M^.,,d)) = ^Ui{X),m) (31) 

then the ruk junction rf.ifi) is constant. 

Proof. Let/i(X) be any estimate satisfying (31) 
and let 

tt = v{Xi,d), (^-l,2, . . . ,n). (32) 

Then, from (31) and (32) we have 

t(MX),He))=Mu' . . ,tn). (33) 

Let T denote the 7i-dimensional interval {Ci<ti<di). 
Applying the transformation (32) and using (29) 
and (33) in (30) we have 

JM 

t=] OXi 

= {V[j4u . ■ . , in)\h{tu . . . , Qdti . . . din^C 
where C is a constant. 

5. Examples 

i\s an example to illustrate the usefulness of 
theorem 4.1, let 



i?{X,e)- 



(2x) 



- s c^.- 



nri^ 2i=i 



and choose V={f—d)-. In this case we choose 

y^{Xu 6) ^X,-d, ^(^) ^d. f C/(X), rP(d)) ^J{X)-d. 

Then (31) becomes 

J{x,-d,. . . ,Xn-d)=J{x,,, . . ,Xn)-d. (34) 

n 

Thus, for example, any weighted mean 2 ^i^z) 



^ ai=l is a CR-estimate of 6. 



If w^e take 

p{X,d) 



1 1 n Xi^ 



and chooso 






then we can take 



V{'ri,d)-^ 






^(0)^6, t(j{x),m)-^ 



and (31) becomes 






j(Xi, . . . y Xn) 

e 



(35) 



On tiu' other hand, if we are estimating d^^' by /(A"), 
we take \l/{d) =6^^^ and (31) becomes 



J yQl/'l^ • • • . QII2J- ^1/2 



(36) 



In these examples, conditions (34), (35), and (36) 
show the reasonableness and generality of the re- 
spective classes of CR-estimates. 

Theorem 4.1 together with Theorem 1.3 can be 
used to o})tain many results similar to Theorems 
3.1 to 3.6, that is, to construct CR-minimax esti- 
mates and to throw light on their gc^nc^ral desira- 
bility. The lower bound for Nf.{\), where /^ is any 
CR-estimate, is readily seen to be rj. (6), where/, is 
a CR-minimax estimate*. This lower bound is 



R7,W=fj7,(^)dHd)^c, 



where ^ is a constant. 

In a recent paper, [2], Hodges and Lehmann have 
illustrated sonu* properties of mininuix estinuites. 
They mention, for example, that it has not been 
possible to obtain a general comparison between 
minimax estimates and unbiased estimates with uni- 
formly smallest variance, if such exist. We can, 
imposing certain restrictions on the foi-m of the 
probability density function />(A^,^), obtain the CR- 
minimax estinuites with uniformly minimum vari- 
ance and show that these estimates are unl)iased. 
Also, a relationship exists between CR-minimax and 
maximum likelihood estimates. 
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